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1. Introduction 

Semidefinite programming is a vast extension of linear programming and has 
a wide range of applications: combinatorial optimization and control theory are 
the most famous ones. Although semidefinite programming has an enormous ex- 
pressive power in formulating convex optimization problems it has a few practical 
drawbacks: Robust and efficient solvers, unlike their counterparts for solving lin- 
ear programs, are currently not available. So it is crucial to exploit the problems' 
structure to be able to perform computations. 

In the last years many results in the area of semidefinite programming were 
obtained for problems which have symmetry. The underlying principle which was 
used here is the following: One simplifies the original semidefinite program which 
is invariant under a group action by applying an algebra isomorphism that maps 
a "large" matrix algebra to a "small" matrix algebra. Then it is sufficient to solve 
the semidefinite program using the smaller matrices. 

The aim of the tutorial is to give a general and explicit procedure to simplify 
semidefinite programs which are invariant under the action of a group. 

A (complex) semidefinite program is an optimization problem of the form 

(1) max{{C,K) : {A^,K) = b„ i = 1, . . . ,n, and K t 0}, 

where Ai G C^^^, and C G C^x^ are Hermitian matrices whose rows and columns 
are indexed by a finite set V, . . . , 5„)* G M" is a given vector and K G C^^^ is 
a variable Hermitian matrix and where y 0" means that K is positive semidef- 
inite. Here (C, K) = tia,ce{C K) denotes the trace inner product between matrices. 
In the following we denote the matrix entry {x,y) of K by K(x,y) instead of the 
more familiar notion K^y This notation will make our treatment of infinite matri- 
ces more natural. 

The punch line of the lecture is the following: Suppose that the semidefinite 
program ([T]) is invariant under a group F of permutations oi V: If K is feasible for 
([T]) then also K^, defined by Ku{x,y) — K{ux,uy), is feasible for all w G F, and 
the objective values of K and coincide. It is crucial to observe that to solve ([T]) 
it suffices to consider only those K which satisfy K{x, y) = K{ux, uy) for all w G F 
because one can symmetrize every optimal solution K to obtain a F-invariant one: 

^^K{ux,uy). 

Now one can apply Bochner's characterization of F-invariant positive semidefinite 
matrices. This is a classical result in harmonic analysis. It says that one can 
represent any F-invariant X by a block-diagonal matrix diag(i^i, . . . , Fi). One has 
the representation 

/ 

K{x,y)^J2^Fk,Zkix,y)), 

k=l 

where Fk are positive semidefinite matrices of size x ruk and Zk{x, y) G C'"''^™'" 
are fixed basis matrices (At this point the notation Zk{x,y) might be confusing, 
it specifies a matrix and not a matrix entry, so it should be better something like 
Z^''^'^ but this looks awkward.). Here the parameters / and mi, . . . ,mi as well as 
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the basis matrices Zfe depend on the group F. The original semidefinite program ([T]) 
simphfies to 



and in some cases this simphfication results into a huge saving. The advantage is 
that instead of dealing with matrices of size \V\ x \V\ one has to deal with block 
diagonal matrices with I block matrices of size mi, . . . ,mi, respectively. In many 
applications the sum mi + ■ ■ ■ + mi is much smaller than \ V\ and in particular many 
practical solvers take advantage of the block structure to speed up the numerical 
calculations. 

In this lecture we develop the theory of "block-diagonalization" of semidefinite 
programs starting from basic principles. We illustrate this symmetry reduction for 
one specific semidefinite program: The theta function for distance graphs on com- 
pact metric spaces. This is a generalization of the Lovasz theta function for finite 
graphs. To focus on this example has the following benefits: It shows the geometric 
core of the process of symmetry reduction. In this context it is natural to connect 
semidefinite programs with harmonic analysis and the theory of group represen- 
tation. In particular this provides the possibility to consider infinite-dimensional 
semidefinite programs which in many cases is a very convenient framework having 
potential for future research. 

Then, this lecture gives the background for the recent development of semi- 
definite programming bounds for combinatorial and geometric packing problems 
initiated by Schrijver [52], and further developed by Laurent [SO], Gijswijt, Schri- 
jver, Tanaka [39], Gijswijt [38j, Bachoc, Vallentin [5], [6], |7], [8], Musin 71J. It also 
gives the background the developments dealing with the more complicated case of 
noncompact metric spaces, see Cohn, Elkies [25], de Oliveira Filho, Vallentin [78] 

We want to stress (once more): 



Then, one important remark: For comments and suggestions concerning these 
notes the author would be very grateful. 



In this section we study the theta function of distance graphs in compact metric 
spaces. These graphs can have infinitely many vertices. The theta function is a 
generalization of the Lovasz theta function for finite graphs, which originally was in- 
troduced by Lovasz in the celebrated paper [64] . The Lovasz theta function gives an 
upper bound for the stability number which one can efficiently compute using semi- 
definite programming. The generalization was studied by Bachoc, Nebe, de Oliveira 
Filho, Vallentin [S]. The generalization also gives an upper bound for the stability 
number which one can compute using semidefinite programming. The main differ- 
ence to the Lovasz theta function is that one has to solve an infinite-dimensional 




The techniques we present here apply to general semidefinite pro- 
grams which are invariant under a symmetry group. 



2. The theta function for distance graphs 
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semidefinite program if the graph has infinitely many vertices. However, we show 
that if the distance graph is symmetric, solving this infinite-dimensional semidefi- 
nite program is feasible. 

Outline of this section: 

In Section 12.11 we provide the necessary definitions from graph theory (stability 
number and distance graphs). We show that finite graphs are distance graphs and 
we present the two main examples: distance graphs on the Hamming cube (related 
to error-correcting codes) and distance graphs on the unit sphere (related to discrete 
geometry) . 

In Section [22] we discuss two possible formulations of the Lovasz theta function. 

Before we can formulate the generalization of the theta function in Section 12.41 
we review an infinite-dimensional generalization of positive semidefinite matrices, 
so-called positive Hilbert-Schmidt kernels, in Section 12.31 There we in particular 
focus on the spectral decomposition theorem which turns out to be central in the 
following. 

In Section 12.51 we show how to exploit the symmetry of distance graphs in or- 
der to simplify the computation of the theta function. For this we introduce the 
automorphism group of a distance graph and explain the essential tool of group 
invariant integration. 

2.1. Distance graphs. We start with some basic definitions from graph theory. 
Let G — (Vj-E) be an undirected graph given by a finite set V of vertices and a 
subset E C of two-element subsets of V called edges. Two vertices x, y with 
{x, y} G E are called adjacent and a family of vertices (xi, . . . , a;„) in which every 
two consequent elements are connected is called a path of length n — I. 

A stable set (sometimes also called independent set) of a graph G is a finite subset 
of the vertex-set in which no two vertices are adjacent. The stability number (or 
independence number) of a graph is the maximum cardinality of a stable set of G: 

a{G) = max{|C| -.CCV, {x,y} ^ E for aU x,y e C}. 

If the graph has infinitely many vertices it may happen that there is no maximum: 
The graph with vertex-set V = S*""^ = G M" : x ■ x = 1}, in which two points 
are adjacent whenever they are orthogonal, has stable sets of arbitrary cardinality. 
In this case, it makes sense to replace the maximal cardinality of a finite stable 
set by the maximal measure of a measurable stable set. Then in case of the circle 

the stability number equals tt. This approach has been worked out by Bachoc, 
Nebe, de Oliveira Filho, Vallentin j9j, but here we will stick to the case when a{G) 
is finite. 

For the definition of a distance graph we use the triple (V,(i, /z). Here y is a 
metric space with the distance function d : V x V ^ R which is equipped with 
the finite, regular Borel measure /x. We assume that V is separable and compact. 
Usually it does not cause confusion if one uses only V to specify the triple {V, d, fi) 
and we refer to V simply as a compact metric space. The adjacency relation of a 
distance graph only depends on the distance map d: 

Definition 2.1. Let V be a compact metric space and let I be a subset of the 
distances which may occur among distinct points in V . The distance graph G{V, I) 
is the graph with vertex-set V and in which two vertices are adjacent if their distance 
lies in I. So the edge-set is E = {{x,y} : d{x,y) E /}. 
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The first example shows that finite graphs are distance graphs. 

Example 2.2. Let G ~ iy, E) be a finite graph. By d : V x V Z>o U {00} we 
denote the length of a shortest path connecting two vertices x and y in G where we 
set d(x, y) = 00 whenever there is no connection at all. If the graph is connected, 
that is if d does not attain the value 00, then d defines a metric on V . The set V 
comes with the uniform measure fi; we have ^i{A) = |^| for any A V . The graph 
G is a distance graph. Two vertices are adjacent whenever their distance is equal 
to one. Hence, the adjacency relation only depends on the distance map. 

The second example comes from engineering, error correcting codes. In fact, it 
can be viewed as a special case of the first example. Error correcting codes have 
great practical importance for communication across noisy channels and for storing 
and retrieving information on media. The idea is that the sender adds redundant 
data to its messages which allows the receiver to detect and correct errors under the 
assumption that there is not unreasonable amount of noise. In particular there is 
no need to ask the sender for additional data. The book Mac Williams, Sloane [53] 
is the definitive reference in algebraic coding theory. 

Example 2.3. Let {0, 1}" be the set of binary strings of length n. It is also called 
the n-dimensional Hamming cube. The Hamming cube is a metric space. The 
distance between two binary strings is measured by the Hamming distance 5, which 
is the number of entries in which they differ: 

(5((xi, . . . ,x„), (yi, . . .,?;„)) = \{i(^{l,--.,n}:xi^ yi}\. 

The Hamming norm (or Hamming weight ) of a vector x G {0, 1}" is \\x\\ = S(x, 0). 

A central parameter is A{n, d), the maximal cardinality of a subset C C {0, 1}" 
such that every two points in it have Hamming distance at least d. In other words, 
A{n, d) is the stability number of the distance graph G({0, 1}", {1, . . . , c? — 1}). A 
stable set in this graph can correct errors. This value has been determined 

for various parameters of n and d, but in general it is unknown to large extend. 
Brouwer j21| maintains a list of lower and upper bounds for A(n, d) for n < 28. 

The third example comes from discrete geometry. 

Example 2.4. We consider the infinite graph G(S'"^^, (0, 9)) whose vertex-set con- 
sists of all the points on the {n — 1)- dimensional unit sphere S"^^ = {x E M" : 
X ■ X = 1}. The distance between two points on the unit sphere can be measured by 
the spherical distance d. For x,y € S"'~^ we have d{x, y) = arccos(a; ■ y), where x ■ y 
denotes the Euclidean inner product. E.g. antipodal points have spherical distance 
d{x, —x) — TT. The unit sphere comes with a measure, the surface area co which is 
induced by the Lesbegue measure on K". We have uj{S"~^) = r{n/2) • 

In the distance graph 0(5""^, (0, 9)) each vertex x is adjacent to a spherical cap 
centered at x. Stable sets of G{S"~^, (0, 9)) are called spherical codes with minimal 
angular distance 9. They are of special interest in information theory. The stability 
number of this graph is also denoted by A{n, 9). 

The kissing number problem is equivalent to the problem of finding A(ri, 7r/3). 
In geometry, the kissing number problem asks for the maximum number r„ of unit 
spheres that can simultaneously touch the unit sphere in n-dimensional Euclidean 
space without pairwise overlapping. The touching points form a spherical code with 
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minimal angular distance 7r/3. The value of t„ is only known for n — 1, 2, 3, 4, 8, 24. 
While its determination for n = 1, 2 is trivial, it is not the case for other values of 
n. The case n = 3 was the object of a famous discussion between Isaac Newton and 
David Gregory in 1694. For a historical perspective of this discussion we refer to 
Casselman [24]. The first valid proof of the fact "T3 — 12", like in the icosahedron 
configuration, was only given in 1953 by Schiitte and van der Waerden [S^ . Odlyzko 
and Sloane [77], and independently Levenshtein [63j . proved rg = 240 and = 
196560 which are respectively the number of shortest vectors in the root lattice E% 
and in the Leech lattice. In 2003, Musin ^70| succeeded to prove the conjectured 
value T4 = 24, which is the number of shortest vectors in the root lattice -D4. See 
also the survey Pfender, Ziegler [73 . The known lower and upper bounds for t„, 
with n < 10, are given in Bachoc, Vallcntin 0. 

2.2. Original formulations for theta. In the original paper [64], the Lovasz 
theta function was given as the solution of a semidefinite program which involves 
a positive semidefinite matrix K e R^^^ whose rows and columns are indexed by 
the finite vertex-set of a graph G = (V, E). In [54, Theorem 3, Theorem 4] Lovasz 
gave the following two formulations of the theta function, which are equivalent by 
semidefinite programming duality: 

i9(G') = max { K{x, y) : K <E M^^^ is positive semidefinite, 

xevyev 

(2) = 1, 

Kix,y)^Oif {x,y}eE}, 

and 

i9(G') = min {A : K G M^^^ is positive semidefinite, 

(3) K{x, a;) = A - 1 for all x eV, 

if(x,2/) = -lif {x,y}^E}. 

There are many equivalent definitions of the theta function. Possible alternatives 
are reviewed by Knuth [58] . One can rephrase ([3]) by saying that A is the minimum 
of the largest eigenvalue of any symmetric matrix K € R^^^ such that K{x, y) = 1 
whenever x = y ot ii x and y are not adjacent. 

Exercise 2.5. Prove this statement. 

It is easy to see that the theta function gives an upper bound for the stability 
number of a finite graph: Let C C T/ be a stable set of maximal cardinality. 
Consider the column vector Ic G which is the characteristic vector of C defined 
by lc(a;) = 1 if a; e C and lc{x) = otherwise. Then, the rank-1 matrix K = 
l/|C|lc(lc)*7 which componentwise is K{x,y) = l/\C\lcix)lc{y), is feasible for 
p. Thus, ^{G) > a{G). 

One can strengthen the equalities in ^ and ^ by the inequalities K{x,y) > 
and K(x, y) < —I respectively. This strengthening was introduced by Schrijver [81j . 
Sometimes it is denoted by i?'(G) and we have i?(G) > z9'(G) > a(G). 

Using semidefinite programming one can compute the theta function in polyno- 
mial time, in the sense that one can approximate it with any given precision. In 
general the upper bound given by the theta function is weak: Feige [361 proved 
that there exists a constant c and an infinite family of graphs on n vertices for 
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which 'd{G)/a{G) > n/2'^^'°8" holds. So in particular it does not give a n^~'^- 
approximation for any fixed e > 0. It is weak for a good reason: Hastad [15] 
showed that for any fixed e > one cannot approximate the stability number of a 
general graph with n vertices within a factor of n^~^ in polynomial time unless any 
problem in NP can be solved in expected, probabilistic polynomial time. However, 
as we will see later, it is sometimes surprisingly good, especially for symmetric 
graphs. 

2.3. Positive Hilbert-Schmidt kernels. To be able to define the theta function 
for infinite graphs we need a concept of positive semidefinite matrices with infin- 
tely many rows and columns. We use positive Hilbert-Schmidt kernels for this, 
which are well-studied objects in functional analysis. Many familiar facts about 
positive semidefinite matrices can be generalized to positive Hilbert-Schmidt ker- 
nels. Historically, the central results of the theory of Hilbert-Schmidt kernels have 
been developed at the beginning of the 20th century mainly by Fredholm, Hilbert, 
Mercer, Schmidt. The classical text books Courant, Hilbert [30] and Riesz, Sz.- 
Nagy [75] are beautiful expositions. A modern treatment is contained for example 
in the comprehensive books by Reed, Simon [74j . 

Let us list some basic properties of positive Hilbert-Schmidt kernels. Here it is 
sometimes enlightening to compare positive Hilbert-Schmidt kernels with the more 
familiar positive semidefinite matrices. 

By C{V) we denote the set of complex-valued continuous functions f : V ^ C 
which is an inner product space by 

(4) (,f,5)= / f{x)'^)dfi{x), 

Jv 

and a normed space by ||/|| = \/(777y- We say that a family of continuous functions 
Ci, 62, . . . £ C(V) is an orthonormal system if 

(e/c, Cfe) = 1 and (cfc, e;) = 0, whenever k ^ I. 

We say that it is complete if every continuous function can be approximated arbi- 
trarily well by finite linear combinations in terms of convergence in the mean, i.e. 
convergence with respect to the norm || • || introduced above. Let ei, 62, . . . , be 
an orthonormal system, then we have the fundamental Bessel's inequality 

d d 

< 11/ - ^feKf = ii/ii' - E !(/' ^^-)r- 

fc=l k=l 

By C{V xV) we denote the set of continuous functions K : V x V ^ The 
elements oiC{V x V) are traditionally called kernels because they appear as integral 
kernels in the theory of integral equations: For instance the homogeneous Fredholm 
integral equation of the second type is an integral equation of the form 

(5) A/(x)- / K{x,y)f{y)d^Ji{y) = Q, 

Jv 

where K G C{V xV) \s a. given integral kernel, and A G C and / G C{V) are to be 
determined. In fact, solutions to ([5]) are given by eigenvalues and eigenf unctions of 
the linear map 

Tk : C{V) ^ C{V), TK{f){x) = [ K{x, y) f {y)diji{y) , 

Jv 
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i.e. nontrivial / G C{V) and A £ C with Tk(/) = A/. 

In the following we only consider symmetric or Hermitian kernels. They satisfy 
K{x, y) = K{y, x) for all x,y eV. A kernel K G C{V x V) is called positive if for 
any nonnegative integer m, any points Xi,. . . ,Xm € V, and any complex numbers 
Ml, . . . , Um, we have 

m m 

K{xi, Xj)uiuj > 0. 

So any finite matrix {K{xi,Xj))i j which one extracts from K has to be positive 
semidefinite. We refer to positive, symmetric, continuous kernels as positive Hilhert- 
Schm,idt kernels or simply as positive kernels. 

Let us turn to the spectral decomposition of positive Hilbert-Schmidt kernels 

K £ C{V X V) where we first recall what happens for positive semidefinite matrices 
A £ C"^" (or in a more snobbish notation A G C({1, . . . , n} x {1, . . . , n})). 

Positive semidefinite matrices A G C"^" can be diagonalized: There is an or- 
thonormal basis ei , . . . , e„ of C" consisting of eigenvectors of A, all eigenvalues 
Ai, . . . , A„ are nonnegative real numbers, and the {x, y) entry of A can be written 
as 

n 

^(2;, y) = XI ^kek{x)ek{y). 
fc=i 

The rank of A is the number of nonzero eigenvalues, counted with multiplicity. 
Suppose that the eigenvalues are ordered in descending order 

Ai > A2 > . . . > A„ > 0, 

then they can be determined from the following max-min characterization: 

Afe = max min (Af, f), 

where Sk runs through all /c-dimensional subspaces of C". Geometrically, the eigen- 
vectors form the principal axes of the ellipsoid {/ G C" : {Af, /) = 1} defined by 
A. The square roots of the reciprocals of the corresponding eigenvalues give the 
length of these axes. 

Essentially the same statements hold true for positive Hilbert-Schmidt kernels. 
The main difference between positive semidefinite matrices and positive kernels is 
that the latter have infinitely many, difii'erent eigenvalues. 

Let K G C{V X y) be a positive Hilbert-Schmidt kernel. To get the flavour of 
the reasoning one needs when working with Hilbert-Schmidt kernels we show using 
Bessel's inequality that the multiplicity of every nonzero eigenvalue is finite: Let 
ei, . . . , Cd be an orthonormal system of eigenfunctions of a nonzero eigenvalue A. 
For fixed a; G V we apply Bessel's inequality to the function y 1— > K{x, y) and get 



d 



I \K{x,y)\''dn{y)>Y^ 

fc=i 



„ 2d 

/ K{x,y)ek{y)dn{y) = V |Aefe(a;)|^. 
Jv 



fe=i 

Integrating both sides again yields an upper bound for d: 



Jv Jv 



\K{x,y)\'d,iiy)di^{x) > \X\'d. 
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Furthermore, one can show that the series of squared eigenvalues converges to 
0. So there are two possibiHties: K has finitely many or infinitely many nonzero 
eigenvalues. In the latter case, the only accumulation point of the eigenvalues is 0. 
The rank of a kernel is the number of nonzero eigenvalues counted with multiplicity. 

We have the spectral decomposition of a positive Hilbert-Schmidt kernel: 

K{x,y) =^\kek{x)ek{y). 

k 

Here Xk are the eigenvalues of which are all nonnegative, and the Ck form a 
complete orthonormal system. In the case of infinitely many different eigenvalues 
the right hand side converges absolutely and uniformly to the left hand side. In the 
case of finitely many different eigenvalues, the finite rank case, the series degenerates 
to a finite sum. In the literature this statement about the spectral decomposition 
of positive Hilbert-Schmidt kernels is often called Mercer's theorem. 

The max-min characterization of the eigenvalues of K reads as follows: Suppose 
that the eigenvalues of K are ordered in descending order 

Ai > A2 > . . . > 0, 

then they can be determined from the following max-min characterization: 

Afc=niax min {TkU),/), 
Sk /eSfc, (/,/)=! 

where Sk runs through all fc-dimcnsional subspace of C(V). In fact the max-min 
characterization of the eigenvalues is used in Courant, Hilbert [30] and Riesz, Sz.- 
Nagy [75] to prove the spectral decomposition theorem. 

Exercise 2.6. Determine the eigenvalues of the inner product kernel K : S"^^ x 
5"^^ — > C defined by K{x,y) — x ■ y. Give the spectral decomposition of it in the 
case n — 2. (Maybe you want to come back to this exercise after Section\^) 

2.4. The theta function for infinite graphs. Now we are ready to generalize the 
theta function to possibly infinite distance graphs G{V, I). It will turn out that for 
the combinatorial and geometric packing problems we consider it is convenient to 
work with the second formulation ([3]) ; possibilities to work with the first formulation 
are considered in Bachoc, Nebe, de Oliveira Filho, Vallentin [9]. In the second 
formulation we simply replace the condition that if is a positive semidefinite matrix 
by the condition that K is a positive kernel. 

-diGiy, /)) = min {A : K e C{V x V) is positive, 

(6) K{x, x) = A - 1 for aU x eV, 

K{x,y)^-lii{x,y}(^E}}. 

When we are working with an infinite-dimensional semidefinite program we can 
no longer use the simple duality argument given in Section 12.21 to show that the 
theta function gives an upper bound for the stability number. So we have to provide 
a new (simple) one: Instead of constructing a feasible kernel from a stable set and 
going to the dual semidefinite program, we show that any feasible kernel can be 
used as a test function on any stable set C to bound the cardinality of C. 

Theorem 2.7. Let G{V, I) be a distance graph. Then, the theta function is an 
upper bound for the stability number: i^{G(V, I)) > a{G{V, I)). 
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Proof. Let C be a stable set of G{V,I) and let K he e. kernel which satisfies the 
conditions in ([6|). We consider the |C| x |C| matrix {K{c,c'))(c.c')ec^ which we 
extract from K. Because this matrix is positive semidefinite we have 

o< ^(^'^')- 

(c,c')ec2 

On the other hand, 

so that |C| — 1 < K{c, c) and the statement follows. □ 

In the same way one can define i9'(G(V,/)) by strengthening the condition 
K{x,y) = 1 to K{x,y) < —1. Then, one has 

diGiV, I)) > d'{G{V, I)) > a{G{V, I)). 

2.5. Exploiting symmetry. We should start to speak about symmetry. In this 
section we show the basic step to simplify the computation of the theta function 
for symmetric distance graphs. Note that this basic step can be applied to other 
semidefinite programs which are invariant under the action of a group. Once one 
has identified this symmetry, this basic step, as well as the following steps, can be 
applied to some extend mechanically. 

Usually many symmetries of a distance graph come from the automorphism group 
of the underlying metric space V. It is the group of all permutations u : V ^ V 
which leave the distance map d invariant: 

Aut{V) — {u -.V ^ V : d{x, y) = d{ux, uy) for all x,y e V}. 

Let us take a look at the automorphism group of the three examples I2.2H2.4I 

Example 2.8. The automorphism group of a metric space defined by the short- 
est paths in a connected graph G — (V, E) equals the automorphism group of the 
underlying graph: 

Aut(G) = {u : y — > y : {x, y} € E if and only if {ux, uy} S E}. 

Computing the automorphism group is not easy. Deciding whether the auto- 
morphism group is trivial is as difficult as the graph isomorphism problem. For 
this no polynomial time algorithm is known. The graph isomorphism problem is 
generally believed to lie in NP n co-NP. So it is unlikely that it is NP-hard. For 
more information on the computational complexity of this problem we refer to the 
book Kobler, Schoning, Toran [59]. On the practical side the program nauty of 
McKay [67] is a very useful tool for computing the automorphism group of a graph. 

Example 2.9. The automorphism group of the Hamming cube {0,1}" has order 
2"7i!. It is generated by all n\ permutations of the n coordinates and all 2" switches 
0^1 which one can identify with the Hamming cube {0,1}" itself where one 
considers addition modulo 2. 

Example 2.10. The automorphism group of the unit sphere is the orthogonal 
group. It is the group of orthogonal matrices 

Aut(S'""i) = 0(M") ^{ue M""^" : u'u = /„}, 

where In is the (n x n) -identity matrix. 
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The orthogonal group is generated by a reflection over some hyperplane and aU 
rotations. It preserves the inner product (and thus the spherical distance) as well 
as the Euclidean distance. Another way to view it is to observe that the elements 
of 0(R") map orthonormal systems of R" to orthonormal systems. 



Crucial observation 



Now we come to a crucial observation: In the computation of the theta function 
([S]) of a distance graph G{V, I) one can restrict the semidefinite program to pos- 
itive Hilbert-Schmidt kernels which are invariant under the automorphism group 
of V. The optimal objective value of this restricted semidefinite program remains 
unchanged, and the matrix sizes which are needed in the practical computation can 
be drastically smaller. 

We say that a positive kernel K E C{V x V) is Aut{V) -invariant if K{ux, uy) ~ 
K{x, y) holds for all u S Aut(y) and all x,y eV . For instance, the positive kernel 
K G C{S^~^ X S^~^) which is defined by the inner product K{x,y) = x ■ y is 
0(M")-invariant. 

If the graph is finite, this crucial observation is easy. If if is a feasible solution 
of ^ so is its Aut(V^)-invariant group average 

(7) K{x,y) = 1^ K{ux,uy). 

' ^ «eAut(y) 

This is also easy to verify, e.g. we have for x,y E V with {x, y} ^ E 

The matrix K is positive semidefinite because it is a nonnegative sum of | Aut(y)| 
positive semidefinite matrices Ku{x,y) ~ K{ux,uy). Also the objective values of 
K and K coincide. 

If the distance graph is infinite we need a replacement for the finite sum ([7]) . For 
this we use the invariant integral of the group Aut(y). If Aut(V^) is finite, then 
the invariant integral is given by the sum ([7]). Generally, this invariant integral is 
defined for compact topological groups and it can be constructed from the Haar 
measure (see e.g. the original Haar |45j . or the standard text on measure theory 
Taylor |88|). Lovasz [65j gives an elementary, combinatorial construction of the 
invariant integral based on the marriage theorem of matching theory. 

Here we only give the defining properties of the invariant integral. For the ex- 
plicit construction we refer the interested reader to the above mentioned literature. 
Actually one could establish the invariant integral in our three examples directly. 

Let r be a compact topological group. A topological group is a group which 
is equipped with a topology so that multiplication and inversion are continuous 
functions. Examples of compact topological groups are all finite groups and the 
orthogonal group 0(R"). The topology here comes from the topology defined on 
(n X n)-matrices. A topological group which is not compact is the group of trans- 
lations (R",-F). 



12 



FRANK VALLENTIN 



We consider the space of complex- valued continuous functions on the group C{T). 
Then there is a unique map /p : C{T) —>■ C, the invariant integral, with the following 
properties: 

(a) Linearity 

j af + I3g ^ a j f + 13 j g for all /, g G C(r), a, /3 G C. 

(b) Monotonicity 

^|/|>0 foraU/GC(r). 

(c) Normalization 




(d) Invariance 

j f{vu) = J f{u) for all v G T. 
Two quick examples: If F is a finite group, the invariant integral simply is 

For the two-dimensional rotation group SO(R^) = {u G O(R^) : detu = 1} the 
invariant integral is 

f f ^ f [ [^°^^ - sin 6'\ \ 

W^) 2^io ^llsin^? coseJJ 

Now the proper replacement of the group average given in ([7]) for general distance 
graphs G{V, I) is 

(8) K{x,y)^ I K{ux,uy), 

JAut(y) 

where we take the invariant integral of the function u K{ux,uy) G C(F). 

3. Harmonic analysis 

In the last section we showed that in the computation of the theta function it 
suffices to consider invariant kernels only. If one chooses an appropriate basis of the 
invariant kernels, then it is possible to express positivity in an efficient way, namely 
as the positive semidefiniteness of a couple of smaller block matrices. To make this 
statement precise we have to use some harmonic analysis which we develop in this 
section starting from basic principles. 

To find an appropriate basis we use basic tools from harmonic analysis and 
representation theory of compact groups which are usually associated with the 
Peter- Weyl theorem. In Section [XT] we present the basic notions and in Section [22] 
we state and prove the Peter- Weyl theorem. In Section 13.31 we apply it to find 
Bochner's characterization of positive, invariant kernels. Throughout this section 
we develop the theory with the help of the spectral theory of positive kernels. 
Then, in the following sections we show how the Peter- Weyl theorem and Bochner's 
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characterization specializes for the computation of the theta functions of distance 
graphs on the Hamming cube and on the unit sphere. 

Literature: The hterature on harmonic analysis and group representations is 
huge. Probably the book by Serre J84j on the linear representation theory of finite 
groups is the most prominent. Other books which emphasize the connection of 
group representations of finite groups to probability is Diaconis [31], one empha- 
sizing the connection to algebraic combinatorics is Sagan [79] , and one connecting 
to chemistry, error-correcting codes, data analysis, graph theory, and probability is 
Terras [8^. Further good sources are: Brocker and tom Dieck [20], Vinberg [93] . 
Carter, Segal, Macdonald [23], Goodman, Wallach [4^. The articles Borel [16], 
Gross 41' and Slodowy [SSj illuminate the historical background. 

3.1. Basic tools. Let us recall the set-up: V is a compact metric space with 
measure /x, and F = Aut{V) is the automorphism group of V. We define the action 
of F on the linear space of complex- valued continuous functions C(V) by 

We assume that the measure fi is F-invariant {ii{uA) — ii{A)), so that we have an 
invariant inner product ([4]) on the space of continuous functions: 

(/, g) = [uf, ug) for all f,ge C{V) and ueT. 

The action of F on the vector space C{V) is linear, that is we have 

u{af + [3g) = auf + Pug for all /, g G C{V), a,(3 E C and u e F. 

Linear actions are called group representations. More vocabulary: A subspace 
S C C{V) is called T -invariant if uS = S for all m € F, i.e. if for every u G F and for 
every / G 5 we have uf G 5 as well. A nonzero subspace S is called T -irreducible if 
{0} and S are the only F-invariant subspaces of S. Let S and S' be two invariant 
subspaces. A linear map T : S ^ S' is called a T-map if T{uf) = uT{f) for all 
w G F, and / G C{V). We say that S and S' are F-equivalent if there is a bijective 
F-map between them. If it is clear from the context which group we consider, we 
frequently omit the prefix "F-" . 

Two little lemmas will make our live easier. 

The first one is also called Maschke's theorem. 

Lemma 3.1. Let S be a invariant subspace and let U C S be an invariant subspace 
of S . Then its orthogonal complement in S , given by 

U^^{geS:i,f,g)^Ofor all f e U} 

is invariant as well. 

Proof. This comes from the invariant inner product: We have for all u G F 

(/, ug) = (m" V, 5) = whenever f eU, geU^. □ 

Maschke's theorem implies that a finite-dimensional invariant subspace which is 
not irreducible splits into an orthogonal sum of irreducible subspaces. 

The second lemma is Schur's lemma. 

Lemma 3.2. Let S and S' be two irreducible subspaces, and let T : S S' be a 
T-map. If S and S' are not equivalent, then T — 0. If they are equivalent then 
either T = or T is bijective. 
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Proof. The kernel of T is an invariant subspace of S, and the image is an invariant 
subspace of 5'. Since S and S' are irreducible subspaces we are left with four 
possibilities: 

(1) The kernel of T is {0} and the image of T is {0}. This could only happen 
when = {0}. 

(2) The kernel of T is {0} and the image of T is S'. Then, T is a bijective 
F-map and S and S' are equivalent subspaces. 

(3) The kernel of T is S" and the image of T is {0}. Then, T must be the zero 
map. 

(4) The kernel of T is S* and the image of T is S' . This could only happen 
when S" = {0}. □ 

We apply Schur's lemma to establish some orthogonality relations which will be 
essential later. 

Lemma 3.3. Let S and S' be two irreducible subspaces. Let ei, . . . , e/i be a complete 
orthonormal system of S and let e'l, . . . , e'^, be one of S' . 

(1) If S and S' are not equivalent, then they are orthogonal to each other: 

{ei, e'j) ^ 0, for i ^ 1, . . . ,h, j = 1, . . . ,h' 

(2) If S and S" are equivalent and if T : S S' is a bijective T-map, mapping 
Ci to e' , then there is a constant c so that 



c ifi=j, 
otherwise. 



Proof. Define the linear map A : S S' by 

h' 

(9) ^(e.) = 5]K-,e,)e;. 

j=i 

This is a T-map. Now the first claim follows immediately from Schur's lemma: We 
have A = Q and so {e'j,ei) — 0. For the second claim we assume A ^ 0. Then 
A is bijective by Schur's lemma. Consider the endomorphism T^^A : 5* — > S", 
T~^A{ei) = Since we work over the complex numbers and T~^A 

is bijective, it has a nonzero eigenvalue c. The corresponding eigenspace is an 
invariant subspace of S, so it has to be equal to S. Hence T~^A is c times the 
identity map, which proves the second claim. □ 

Exercise 3.4. Check that the map A defined in ([9|) is indeed a T-map. 

3.2. The Peter- Weyl theorem. The theorem of Peter and Weyl [75] and [HS] 

is the starting point of harmonic analysis of compact groups. It connect Fourier 
analysis with group representations. It shows that the space C{V) decomposes 
orthogonally into finite-dimensional irreducible subspaces and that the space C{V) 
has a complete orthonormal system which is "in harmony" with the group F — 
knt{V). 

Theorem 3.5. All irreducible subspaces of C{V) are of finite dimension. The space 
C{V) decomposes orthogonally as 



C(V)= i/fe. 



fc=0,l, 
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and the space Hk decomposes orthogonally as 

2— l,2,...,mfc 

where Hk^i is irreducible, and H^.i is equivalent to Hk',i' if and only if k ^ k' . The 
dimension hk of Hk,i is finite, but the multiplicity ruk can potentially be infinite. 

In other words, C{V) has a complete orthonormal system ek,i,i, where k — 
0,1, . . ., i — 1,2, . . . , ruk, I ~ 1, ■ ■ ■ ,hk so that 

(1) the space Hk.i spanned by ek,i,i, ■ ■ ■ , Sk,i,h^. is irreducible, 

(2) the spaces Hkj and Hk'.i' Ou'^e equivalent if and only if k = k' , 

(3) there are T-maps (t>k,i : Hks ~^ Hka mapping eks,i to ek,i,i. 

Proof. Consider an element / G C(V). Define the positive F-invariant kernel K G 
C{V X V) using invariant integration 

K{x,y) ^ J f{ux)f{uy). 

The eigenspaces of Tk of nonzero eigenvalues are finite-dimensional. They are 
invariant because for an eigenfunction g G C{V) to an eigenvalue A we have 

TK{ug){x) = / K{x,y)g{u^^y)dfi{y) 
Jv 

= / K{x,uy)g{y)dp.{y) 
Jv 

= / K{u-^x,y)g{y)d^i{y) 
Jv 

= \g{u^^x) 
= Xug{x). 

Now the statement follows: One breaks the invariant finite-dimensional eigenspaces 
into irreducible subspaces using Maschke's theorem. The orthonormal system one 
constructs using Gram-Schmidt orthonormalization. The orthogonality relation 
follow from Lemma [3.31 Completeness follows form the spectral decomposition of 
positive kernels. □ 

3.3. Bochner's characterization. The complete orthonormal system ek,i,i of the 

Peter- Weyl theorem is very useful to characterize F-invariant, positive kernels. This 
is the contents of the following theorem by Bochner [T5]. 

Theorem 3.6. Let Ck^i^i be a complete orthonormal system for C{V) as in Theo- 
rem \3.5\ Every T -invariant, positive kernel K G C{V x V) can be written as 

(10) K{x,y)= A-.jj 

fe=0,l,... ij = l,2...,mfc 1=1 

or more economically as 

(11) K{x,y)^ J2 (F'^^zt'^)' 

k=Q,l,... 

with {Fk)ij = fk,ij and {zjf'^'')ij = J2'i=iek,i,i{x)ek,j,i{y). Here Fk is Hermitian 
and positive. The series converges absolutely and uniformly. 
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Proof. It is clear the every kernel of the form (fTO|) or (fTTj) is a positive F-invariant 
kernel. Generally, a kernel K G C{V x V) can be written as a series in the basis 
{x,y) 1-^ ek,i,i{x)ek',i',i'{y)- Let fk,i,i;k' A' m be the corresponding coefficient. We 
shall show that fk,i,i;k' A' .i' = if k ^ k' or I ^ I' and that fk,i,i-k,i' ,i does not 
depend on Z, so that we can set fkA,i:k,i',i = fk.u'- For this we consider the, possibly 
degenerate, F-invariant inner product on C(V) defined by K: 

{f,9)K^\ I f{x)K{x,y)g{y)dfi{x)dfi{y). 
Jv Jv 

For this inner product, the orthogonality relations of Leinma l3 . 3l applv. which proves 
the claim about the coefficients fk,i,i-k',i',i'- Furthermore, the inner product (j3.3p 
defines an inner product on the space spanned by the vectors 

'Pk,i = (efc,i,i, . . . , ek,i,hj, i = 1, . . . , TOfc, 

by 

'^^e-k,i,u'^^ek,i' ,1 j . 

1 = 1 1=1 1 K 

Hence, Fk = {h,ii' ) is positive. □ 

Bochner's characterization is the tool for exploiting symmetry in semidefinite 
programs. If one wants to optimize over all F-invariant kernels, then one only has 
to optimize over positive ^l-'s which have size nik x mfe. The computation of Z^"^^ 
is part of the preprocessing. It can be done by hand, as in the next two sections, 
or by computer if the group is finite and has moderate size. See the article Babai, 
Ronyai j3j for algorithmic aspects. 

4. Boolean harmonics 

In this section we specialize the Peter- Weyl theorem and the characterization of 
Bochner to the n-dimensional Hamming cube {0,1}". We apply these results to 
calculate the theta function for distance graph on {0, 1}". Then we give a complete 
proof of the specialization using Fourier analysis of Boolean functions. 

4.1. Specializations. Recall that the space of continuous functions C({0,1}"), 
also known as complex vectors indexed by {0, 1}", comes with an inner product 

2;e{0,l}" 

which is invariant under the action of the automorphism group. The group has 
order 2"n! and it is generated by permutations and switches 0^1. We make 
extensive use of the functions Xy{^) — which are called characters. The 

character Xy evaluates to 1 if the Hamming distance between x and y is an even 
integer and to —1 if it is an odd integer. 

The Peter- Weyl theorem specializes as follows. 

Theorem 4.1. The space C({0, 1}") decomposes orthogonally into irreducible sub- 
spaces Hk'. 

C{{Q,lY) = Ho ^ Hi ± . . . L H,,. 

Here Hk denotes the subspace spanned by the Junctions Xv{^) — (^l)*'^? with 
Hamming norm \\y\\ = k. It is an irreducible subspace of dimension hk — (^). 
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In the decomposition all irreducible subspaces have multiplicity 1. We have 
Hk = Hk.i in the notation of Theorem 13.51 Hence, Bochner's characterization 
(Theorem I3.6P of the Aut({0, l}")-invariant, positive kernels only involves positive 
semidefinite matrices of size 1x1, i.e. nonnegative scalars. The automorphism group 
acts distance transitively on pairs: For every two pairs of points {x^y) and [x' ,y') 
having the same Hamming distance 5{x,y) = S{x',y') there is a group element 
u e Aut({0, 1}") which transforms the pairs into each other {ux,uy) = {x',y'). So 
Aut({0, l}")-invariant kernels are actually functions depending only on the distance 
S. 

Theorem 4.2. Every Aut{{0,l}^) -invariant, positive kernel K £ C({0, 1}" x 
{0, 1}") is of the form 

n 

K{x, y) = J2 fkKj:{S{x, y)), /o, ...,/„> 0, 

where KJ} denotes a the Krawtchouk polynomial of degree k. 

Krawtchouk polynomials KJ! {t) are orthogonal polynomials for the inner product 

which are normalized by KJ^ (0) — (^) . The first few Krawtchouk polynomials are 

KSit) = 1, 
K^{t) ^ -2t + n, 

K^{t) = 2t^ - 2nt + 

and in general 

!:(:)(::;)(-)■■ 

See the books Szego f57| or Mac Williams, Sloane [S^ for more information on 
Krawtchouk polynomials. 

4.2. Linear programming bound for binary codes. Now the calculation of 
the theta function on the Hamming cube reduces to Delsarte's linear programming 
bound which was introduced by Delsarte |32| . The connection between Lovasz' 
theta function and Delsarte's linear programming bound was first observed by 
McEliece, Rodemich, Rumsey Jr. 76 and independently by Schrijver [5T| . 

^'{{0, iy\ {l,...,d-l}) = min |a : K E C({0, 1}" x {0, 1}") 

positive semidefinite, 
Aut({0, l}")-invariant, 
K{x, = A - 1 for aU x e {0, 1}", 

Kix,y)<-lifS{x,y)e{d,...,n}y 

Now we represent the positive semidefinite matrix K G C({0, 1}" x {0, 1}") which 
is Aut({0,l}")-invariant by Kix.y) = TZ=o hKl{5{x,y)) with /o,...,/„ > 0, 
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using the characterization of Bochner, and get the following linear program in the 
variables /o, . . . 

n 

min{l + ^AXr(0) : /o,...,/„>0, 

fc=o „ 

fe=0 

Exercise 4.3. Calculate i5'({0, 1}", {1, . . . , d — 1}) for, say, all n = 1, . . . , 128 and 
all d. 

4.3. Fourier analysis on the Hamming cube. To show Theorem 14.11 and The- 
orem lT^ we make use of discrete Fourier analysis of Boolean functions. This theory 
has recently been successfully developed and applied to problems concerning thresh- 
old phenomena and influence in combinatorics, computer science, economics, and 
political science. E.g. Hastad's result on the hardness of the stability number, we 
refered earlier to, uses Fourier analysis of Boolean functions. Kalai and Safra [50] 
survey these developments. One can find further information on Boolean harmonics 
e.g. in the book Terras [89], in the paper Dunkl [35], or in Delsarte's thesis [32] . 

A complex- valued function on the n-dimensional Hamming cube / : {0, 1}" ^ C 
has the Fourier expansion 

ye{Q.i}" 

with the characters Xyi^) = (—1)^'^ and the Fourier coefficients f{y). The char- 
acters Xy, y S {0, 1}", which sometimes are called Walsh functions, form an or- 
thonormal basis of the space C({0, 1}"). The one-dimensional subspace spanned 
by each Xy is invariant under the abelian group ({0, 1}", +) of addition modulo 2, 
which corresponds to the switching <-> 1: 

uxy{x) = Xy{x -u) = {-l)y(--) = {-l)v(-)xy{x). 

Under the complete automorphism group of the Hamming cube, i.e. when also 
permutations are allowed, then the one-dimensional subspaces are grouped together 
according the Hamming norm of y. The -dimensional spaces 

fffc = lin{xa : II2/II = fc}, /c = 0, ...,n, 

are Aut({0, l}")-invariant. We shall show that they are irreducible and compute 
the corresponding basis function Zk for Bochner's characterization. 

For this the notion of zonal spherical function is helpful. A function / : {0, 1}" 
C is called a zonal spherical function with pole e G {0, 1}" if / = uf for all u G 
Aut({0, 1}") which stabilize the point e, so that ue — e. 

Lemma 4.4. Let U C C({0,1}") he an invariant subspace and let e G {0,1}". // 
the subspace of zonal spherical functions with pole e which lie in U has dimension 
exactly one, then U is irreducible. 

Proof. Suppose U is not irreducible and U decomposes into nontrivial invariant 
subspaces U — V 1. V-^. Let ei,...,e„ be an orthonormal system of V and 
let fi,...,fm be one of V-^. Then the functions zv{x) — Y^"^iei{e)ei{x), and 
zy±{x) = fi{^)fi{^) two linearly independent zonal spherical functions 

with pole e. □ 
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We apply this lemma to Hk- Let / G Hk be a zonal spherical function with 
pole 0" = (0, . . . , 0). The group stabilizing 0" is the group of all permutations. For 
every permutation u we have uf — f and in terms of the Fourier expansion: 

J{y)Xuy{x) ^ ^ J{y)xy{x). 
ye{o,i}'Ml!/ll=fe ye{o,i}",!|y||=fc 

So the Fourier coefficients /(y) all have to coincide. 

Exercise 4.5. Show that the Hk 's are pairwise not equivalent. 

Let us calculate the basis function Zk{x,x') for Bochner's characterization (and 
in this way the zonal spherical functions of Hk because a zonal spherical function 
with pole e is a multiple of Zk{e, •) as one sees from the proof of Lemma [4.41) . We 
have 

Zk{x,x')= ^ Xu{x)xu{x'). 
\\y\\=k 

Since Zk is invariant under Aut({0, 1}") it only depends on the Hamming distance 
t = S{x,x') and so we can assume that x = 0, and x' = 1*0"~*. A straightforward 
combinatorial calculation yields 

Zkix,x') = Z,(0,1*0"-*) = E(-1)^Q (fc Z ■) = ^kit)- 
Because of Lemma 13.31 we have the orthogonality relation 

j2(fjKTimTi,it) = o, \ik^k'. 

t=o ^ ^ 



5. Spherical harmonics 

In this section we replace the n-dimensional Hamming cube by the unit sphere 
^n-i n-dimensional Euclidean space. First we specialize the Peter- Weyl theo- 
rem and Bochner's characterization. We apply these results to calculate the theta 
function for distance graph on the sphere. This gives tight bounds for the kissing 
number in dimension 8 and 24. Then we give a complete proof of the specialization 
using the theory of spherical harmonics. 

Spherical harmonics have been extensively studied due to their importance in 
mathematics, physics and engineering. Comprehensive information is available. 
Here we follow to some extend the book [2, Chapter 9] by Andrews, Askey and 
Roy, and the paper [2B] by Coifman and Weiss which links spherical harmonics 
with the representation theory of compact topologic groups. A lot of information 
is contained in the book 92] by Vilenkin and Klimyk. Also the book j38j by Miiller 
is very useful. 

Spherical harmonics serve as a full-fledged example of geometric analysis: See the 
books [14j by Berger and [17] by Helgason. The paper by Stanton jSB] emphasizes 
the similarities between spherical harmonics and Boolean harmonics. 
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5.1. Specializations. Recall that the space of continuous functions C(5" ^) comes 
with an inner product 



which is invariant under the action of the orthogonal group 0(M"), the automor- 
phism group of the unit sphere. 

The Peter- Weyl theorem specializes as follows. 

Theorem 5.1. The space of complex-valued continuous functions on the unit sphere 
decomposes orthogonally into pairwise 0{W^) -irreducible subspaces as follows: 



Here Hk denotes the space of homogeneous polynomial functions of degree k which 
vanish under the Laplace operator A — + • • • + . The space Hk has dimension 



Since in the decomposition all irreducible subspaces have multiplicity 1, i.e we 
have Hk = in the notation of Theorem 13. 5i Bochner's characterization of 

the 0(M")-invariant, positive kernels again only involves nonnegative scalars. The 
orthogonal group acts distance transitively on pairs of points on the sphere. So 
0(M")-invariant kernels are actually functions depending only on the spherical dis- 
tance d{x,y) = arccosa; ■ y. It turns out that they are univariate polynomial func- 
tions in the inner products t = x ■ y. Schoenberg in [80] was the first who gave this 
characterization. 

Theorem 5.2. Every positive, 0{W^) -invariant kernel K € C{S'"~^ x S"""^) can 
be written as 



The right hand side converges absolutely and uniformly. Here ^\ {t) is the nor- 
malized Jacobi polynomial of degree k with parameters (a, a), where a — {n~ 3)/2. 

The Jacobi polynomials with parameters {a, (3) are orthogonal polynomials for 
the measure (1 - i)"(l -I- tfdt on the interval [-1, 1]. We denote by p[°''^'^ the 
normalized Jacobi polynomial of degree k with normalization P^'^^\l) = 1. The 
first few Jacobi polynomials are 





CiS"^'^) = Hq ^ Hi L H2 ^ . . . 




OQ 



K{x, y) = Y, fkPt"\^ -y), /o, /i, . . . > 0. 



Pr"^(t) = l, 




n 



1 



1 



1 



For more information on Jacobi polynomials we refer to the books [Ij by Abramowitz, 
Stegun, [87] by Szego, |2] by Andrews, Askey and Roy, and |62] by Lebedev. 
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5.2. Linear programming bound for the unit sphere. Before we give a proof 
of these two theorems, we demonstrate how to apply them to calculate the theta 
function for distance graphs on the unit sphere. Again the original semidefinite 
program degenerates to a linear program because C{S"~^) decomposes multiplicity- 
free into Aut(S'"^^)-irreducible subspaces. This linear program was studied by 
Delsarte, Goethals and Seidel in ^33] and by Kabatiansky and Levenshtein in [49] . 
but see also Sloane [29, Chapter 9]. 

A case for which the calculation of the theta function is particularly simple is the 
graph G(S'"^^, (0, tt/2)). Every vertex is adjacent to a pointed half-sphere. Despite 
its simplicity this case already carries all the features of the more complicated 
kissing number cases G(S'^ (0,7r/3)) and G{S^^, (0,7r/3)). 

We show that the vertices of a regular cross polytope (the 2n points ie^, with 
i — 1, . . . ,7i) form an optimal spherical code. We say that N points on the unit 
sphere S"^^ form an optimal spherical code if they maximize the minimal distance 
among all A^-point configuration on S*"^^. 

Using Theorem [5J] the theta function ?9'(G'(S'""\ (0,7r/2))) simplifies to 



OO 



inf{l + ^/feP("^")(0) : /o,/i,...>0, 



oo 

E 

fc=0 



/fcPi"'")(cos0)<-lif(?e[7r/2,^]}. 



Consider a polynomial F{t) — X]fe=o /*:-^fe" (^) "^ith nonnegative coefficients 
/fc. If F{t) < -1 for aU t e [-1,0], then 

F(0) + 1 > d'iGiS--\ (0, n/2))) > a(G(5"-\ (0, n/2))). 

We give an explicit F which proves the sharp bound of 2n. Generally, if a 
polynomial proves a sharp bound, then all inequalities in Theorem 12.71 have to be 
equalities. So we have F{t) = —1 for all t which occur as inner product. 

We make the Ansatz 

.2 1 



Fit) = /o + fit + f2 -t' - 



71 — 1 n — 1 
and in order that the bound is sharp we have to have 

1 + F{1) 2n, 

m = -1, 
F{-1) = -1. 

So we consider the system of linear equations 

'1 1 
1 

which happens to have the nonnegative solution /q = 0, /i = n, /2 = n — 1. 

Exercise 5.3. Why does this show that the vertices of a regular cross polytope form 
an optimal spherical code? 

Exercise 5.4. Show that Ty ^ l?'(G(5^ (0,7r/3))) = 240. 







V] 

ri-1 1 




1 / 






22 



FRANK VALLENTIN 



Exercise 5.5. Show that i}'{G{S^'^, (0,7r/3))) = 196560. 

Exercise 5.6. Show that the vertices of the icosahedron form an optimal spherical 
code. 

For more on optimal spherical codes see [13], [53], [31], ^7\, [TT], [7]. Recently, 
Wang 94j gave a list of conjectural optimal spherical codes up to dimension 8 and 
up to 27 points. It is a challenge to find proofs of their optimality. 

5.3. Harmonic polynomials. To establish the specialization of the Peter-Weyl 
theorem for the case of the unit sphere we consider harmonic polynomials. 

For the vector space of complex polynomial functions / : C" C which are 
homogeneous and are of degree d we write Pol=(i(C"). Each / G Pol=£i(C") 
satisfies the equation f{ax) = a'^f{x) for all a G C and x G C". The di- 
mension of Pol=rf(C") is C^^'^^)- We define the polynomial oj G Pol=2(C") by 
uj{xi, . . .,Xn) =xl~\ h a;2. 

Furthermore, we introduce two differential operators: The Nabla operator 

v = (— r 

dxi dxn 

and the Laplace operator 

which maps Pol=rf(C") to PoLd_2(C"). It will be convenient to have the following 
inner product on Pol=c((C"): 



for which the basis of monomials is an orthogonal basis with 

777 T I . . . 777 I 

m„ mi m„ \ _ ' ' 

\-^l ■''n ■''n I ^1 

This inner product might look mysterious, but it turns out to be very natural if 
one considers polynomials as symmetric tensors as we point out in the appendix. 

It is a simple and helpful fact that multiplication with a; is the adjoint to the 
Laplace operator: 

Lemma 5.7. For all f G Pol=(i-2(C") and for all g G Pol=d(C") we have 

(c./,5) = (/,Ag). 

Proof. This follows immediately from the obvious fact that the evaluation map / ^ 
/(V) is an algebra isomorphism between the algebra of polynomials C[a;i, . . . , Xn] 
and the algebra of differential operators . . . , jf-]. In particular, this implies 

(c./)(V) = /(V)c.(V), and hence {cjf, g) - if{V){Kg) = (/, A.g). □ 

The kernel of the Laplace operator is the space of harmonic polynomials which 
we denote by 

Harmrf = {/ G PoLrf(C") : A/ = 0}. 
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Theorem 5.8. The space Pol=d(C") decomposes orthogonally into harmonic poly- 
nomials as follows: 

PoLrf(C") = Harnid _L w Harni(i-2 ^'^ Harmrf_4 _L . . . . 

The dimension o/Harmfe, which we denote by hk, is equal to ("j'^'!^^) ^ 

Proof. We first show that A maps Pol=d(C") onto Pol=d_2(C"): By Lemma O 
we have for / e (A(PoLd-2(C"))V and for g e PoUd(C") 

(c^/,5) = (/,Ag)=0, 

hence Lof, and so /, must be 0. The dimension formula for linear maps implies the 
desired formula for hk ■ 

With similar arguments one shows that 

Pol=d(C") = Harmd ± w Pol=d-2(C"). 

Before we can inductively derive the orthogonal decomposition as stated in the 
theorem we only have to verify that [x)Harmrf_2 is orthogonal to lj^ Pol=rf_4(C"). 
The following identity will be helpful: for all p £ Pol=c((C"), we have Aojp — 
2{n + 2d)p + ujAp (in the calculation, notice that X^iLi ^i^rP ~ '^P)- Then, 

{iof,u:^g) = {f,Au;'g) 

= {f,2{n + 2{d-2))ujg + ujAujg) 
= (/,t^(2(n + 2(d-2))5 + Ac^g)) 
= 0, 

for aU / e Harmd-2 and g £ PoUd_4(C"). □ 

By Pol<£;(S'""^) we denote the vector space of complex-valued polynomial func- 
tions on S*"^^ which are of degree at most d. This is a subspace of C{S"^^) so it is 
equipped with the inner product 

if,g) = / f{x)g{x)dujn{x). 

Recall that the measures on 5"^^ and S*"^^ are related by 

duj,,{x) - (1 - xD^^-^^/^dXidcUn^ixj + ■■■+ xl)-^/^(x2, . . . ,X„)). 

The orthogonal group acts on the space of polynomial functions by uf{x) = 
f{u~^x). The space of harmonic polynomials Harm^ is invariant under this action. 
Hence, we have for all / G Harm^j and all u G 0(M") that uf G Harnic;. This 
follows from the fact that A{uf) = u{Af). Another more conceptual way to 
see that Harmrf is an 0(M")-invariant space uses the fact that the inner product 
(•, •) is invariant under the action of 0(]R"). We will give a proof of this in the 
appendix where the connection between symmetric tensors and polynomials will 
also make the inner product (•, •) more transparent. It is obvious that the summand 
ti^ PoLrf_2(C") in the decomposition (15. 3|) is an 0(M")-invariant space. Then, its 
orthogonal complement, Harm^, is an 0(M")-invariant space, too. 

The space of spherical harmonics 

Hk = {/|5'-i : / e Harmfe} 

is also an 0(M")-invariant space. The spaces Harmfe and are equivalent as 
0(M")-invariant spaces: the restriction map </?(/) = /|S"-i is a bijective linear map 
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ip : Harm/c Hk with f{uf) — ULp{f) for all / G Harnifc and all u e 0(R"). Note 
that the inverse map of (f is given by ip^^{f){x) = • 

The following construction will turn out to be important: For an orthonormal 
system ei, . . . , ejn. of Hk we define the function 

Zfe : ^"-1 X 5-"-! -> C by Zfe(x, = ^ e,(a;)e7(y). 

i=l 

Proposition 5.9. The following properties hold for the Junction Zk-' 

(a) Zk does not dependent on the choice of the orthonormal system of Hk- 
(h) For all u G 0(R") and for all x,y S"'~^ we have Zk{ux,uy) = Zk(x,y). 

Proof, (a) Let e'l, . . . , e'^^ be a second orthonormal system of Hk- Let u be the 
orthogonal matrix u — (uij) £ 0(M'''=) with uci — e'^, i ^ 1, . . . , hk- Then, 

hk hk 

= ^(.uei)ix){uei){y) 

i=l 1=1 

hk hk 

= J2'^^3^ej{x)ujiej{y) 

i=i j=i 

hk hk 

= '^[^U3^Uji^ej{x)ej{y) 
j=i i=i 

hk 

J=l 

(b) Since the inner product on Pol<d(S'"~^) is invariant under 0(R"), it follows 
that u^^Ci, i ~ 1, - - - ,hk is an orthonormal system of Hk- Now (b) is a consequence 
of (a). □ 

As in the case of Boolean harmonics we make use of the concept of zonal spherical 
functions to prove the irreducibility of Hk- A function / : S"~^ ^ C is called a 
zonal spherical function with pole e £ S"~^ if / = u/ for all u £ Stab(0(R"), e) = 
{u G 0(M") : lie = e}. Property (b) of Proposition 15.91 implies that x ^ Zk{e, x) is 
a zonal spherical function with pole e. In direct analogy to Lemma 14.41 we have: 

Lemma 5.10. Let U C Pol<(i(>S'"^^) be an 0{M") -invariant space and let e G 5'"""'^ 
be a point on the sphere- If the dimension of the space of zonal spherical functions 
with pole e which lie in U is exactly one, then U is an 0(W^) -irreducible space- 
Now the specialization of the Peter- Weyl theorem given in Theorem 15.11 follows 
form the following theorem. 

Theorem 5.11. The space of complex-valued polynomials restricted to the sphere 
which are of degree at most d decomposes orthogonally into the spaces of spherical 
harmonics: 

Pol<d(5"-i) = Ho±Hi±---±Hd- 
The Hk are 0{W^)- irreducible spaces which are pairwise inequivalent. 
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Proof. Using Lemma [5.101 we show that the Hk's are irreducible. Let / € Hk be a 
zonal spherical function with pole e — (1, 0, . . . , 0)*. We consider / as a polynomial 
in Harm/i; . Then we can write 

k 

f{xi, . . . , a:„) = ^ x'l~'-pi{x2, . . . , Xn), 

i=0 

where pi is a homogeneous polynomial of degree i which is invariant under the ac- 
tion of the stabilizer group Stab(0(R"), e). That means that pi is a radial function, 
a function which only depends on the norm of {x2, . . . , a;„). Because pi is homoge- 
neous of degree i we have Pi{x2, . . . , x„) — Ci{x'^ + ■ ■ ■ + x'^Y^'^ for some constant 
Ci. Hence, Pi = whenever i is an odd integer. So we can write 

k/2 

fixi, ...,Xn) = Yl + • • • + xly. 

1=0 

Since / is a harmonic polynomial we have (check this) 

k/2 

= A/ = ^(a,c, + /?,c,_i)x^2*(a;2 + ■■■ + xly-\ 
1=1 

where = 2i{n + 2i-3), 13^ = {k - 2i + l){k - 2i + 2), and so a = (-l)*co|p^, 
for i = I, . . . ,k/2, which shows that / is determined by cq only. Thus, the space of 
zonal spherical functions is one-dimensional. 

If n > 2, then, by looking at the dimension hk, it follows immediately that Hk 
are pairwise inequivalent and also the orthogonality relation between them. The 
case n = 2 we leave as an exercise (see below). □ 

Exercise 5.12. Make the link between the spherical harmonics for the unit circle 
and the classical Fourier series of 2t: -periodic functions. Show how the the functions 
cos kx, sin kx, with k = 0,1,2, ... , appear as orthonormal system for the spaces Hk . 
Show that Hk and Hk' with k ^ k' are not equivalent. 

5.4. Addition formula. To establish Bochner's characterization for the sphere 
we still have to calculate Zk{x,y). For this we do not need to write down an 
orthonormal system of Hk explicitly. Studying the orthogonality relations and 
their interplay with zonal spherical function suffices here. 

Theorem 5.13. Let ei, . . . , e/j^ be an orthonormal system of Hk. For a;, y G S"~^ 
we have the addition formula 

Pi'-''''^'-''''\x-y)=akj:e,{x)W), 

1=1 

for a positive constant ak . 

Proof. Let Zk and be zonal spherical functions for Hk and Hk' , with k ^ k' , both 
with pole e — (1,0, .. . ,0). By the orthogonality relation between nonequivalent 
0(]R")-irrcducible subspaces it follows 

{zk,Zk')= / Zk{x)zk'{x)dLj{x) ^ 
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The functions Zk and Zfc/ are invariant under the stabiUzer group Stab(0(M"), e). 
So they only depend on the first coordinate. One can use (|5.3p to express the 
integral above as one involving only xi: 

nl 

Zk{xi, . . .,Xn)zk'{xi, . . .,Xn){l - xf)'^'^^^^ dxi = 0. 



'1 

Hence, the zonal spherical functions z^. are multiples of Jacobi polynomials: 
and hence 

1=1 

The constant is positive because p^("~^/^)'("~-^)/^^(i) = i. □ 

Appendix A. Symmetric tensors and polynomials 

In this section we write V for C". The symmetric group Sd of permutations 
a : {1, . . . , d} ^ {1, . . . , d} acts on V^'^ by permuting positions: 

Cr{vi IS) ■ ■ ■ iSi Vd) ^ Vc-i(l) ® • ■ • ® Vcr-i(d), 

with a Cz Sd- The subspace of symmetric d-tensors over V is 

SdiV^^) ^ {u £ V'^'^ : au ^ u for aU a £ S^}. 
The inner product on V^"^ is given by 

d 

(wi €5 • ■ • ® Wd, wi €5 • • • (g) Wd) = • Iwi. 

4=1 

Let ei, . . . , e„ be the standard basis of C". Define the linear map 

7r : T/«5'i Pol=d(C"), by e^, S ■ ■ ■ S e,, ^ x,, ■ --x,^. 

For an n-tupel (mi, . . . , m„) of nonnegative integers with mi + • ■ • + m„ = d define 
the symmetric monomial 

•S(mi,...,m„) — ^ ^ Cii C$ ' ■ ' 05 , 
(il,...id) 

where we sum over all (ii, . . . ,id) so that the number j G {1, . . . ,n} occurs in 
(ii, . . . , ifj) with multiplicity ruj, e.g. 

S(l,2) = ei ® 62 ® 62 + 62 ® ei ® 62 + 62 ® 62 ® 61. 

The symmetric monomials form a basis of S'(j(T^). We have 

d\ 



71'(S(mi,...,m„)) 



I I 1 

mi! • • • m„! 



Hence, the map tt restricted to symmetric d-tensors over V is an isometry due to 
the specific choice of the inner product (•, •) in Pol=d(C"). 

The orthogonal group 0(R") acts on V^'^ by u{vi ® • • ■ ® Vd) — uvi S ■ ■ ■ <E) uvd, 
with u G 0(M"). Let us verify that tt commutes with the action of 0(R"), i.e. 
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■7t{us) — aTT{s) for all s G Sd{V) and u = {uij) G 0(M"). It suffices to consider the 
basis S(mi,...,m„)- We have 

7r(wS(mi,...,m„)) = 7r(^ ^ Ue^j (g) ■ • ■ (g) liCj^ j 
(*!,■■■, id) 

n 

(ii,...,id) Jl,--- Jd = l 

n n 

j=l j=l 

d! 



^Jd 



mi! • ■ • TO„! 



= "7r(s(mi,...,m„))- 

We finish by showing that the inner product on PoLd(C") is invariant under 
the action of the orthogonal group. For this note that the inner product on V'^'^ 
is invariant under this group action. From the previous considerations we have for 
all u e 0(M") the chain of equalities: 

{uf,ug) = (7r"^(u/),7r"^(ug)) 
^ {U7r-\f),u7r-\g)) 

= {f,9)- 



Appendix B. Further reading 

In the last years many results were obtained for semidefinite programs which are 
symmetric. This was done for a variety of problems and applications: 

— interior point algorithms (Kanno, Ohsaki, Murota, Katoh 51 , de Klerk, 
Pasechnik [53], Murota, Kanno, Kojima, Kojima (69j). 

— polynomial optimization (Gatermann and Parrilo [37j and Jansson, Lasserre, 
Riener, Theobald [iS^, Bosse [T7, Laurent [5T]). 

— truss topology optimization (Bai, de Klerk, Pasechnik, Sotirov [TU]!. 

— quadratic assignment problem (de Klerk, Sotirov [57]), 

— fast mixing Markov chains on graphs (Boyd, Diaconis, Xiao [19], Boyd, 
Diaconis, Parrilo, Xiao [T8]). 

— graph coloring (Gvozdenovic, Laurent [33], [33], Gvozdenovic [32]), — cross- 
ing numbers for complete binary graphs (de Klerk, Pasechnik, Schrijver 

ED), 

— travelling salesman problem (de Klerk, Pasechnik, Sotirov [5S]). 

— coding theory (Schrijver 82J, Gijswijt, Schrijver, Tanaka [39], de Klerk, 
Pasechnik [54,, de Klerk, Newman, Pasechnik, Sotirov [S2], Laurent [50] . 
Bachoc [3], Vallentin [H], Bachoc, Vallentin [5]), 

— low distortion embeddings (Vallentin [SD]) 

— geometry (Bachoc, Vallentin [5], [B], [7], Bachoc, Nebe, de Oliveira Filho, 
Vallentin [5]). 
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